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Abstract 

We study harmonic functions on random environments with particular emphasis 
on the case of the infinite cluster of supercritical percolation on Z rf . We prove that 
the vector space of harmonic functions growing at most linearly is (<i+l)-dimensional 
almost surely. In particular, there are no non-constant sublinear harmonic functions 
(thus implying the uniqueness of the corrector). A main ingredient of the proof is 
a quantitative, annealed version of the Avez entropy argument. This also provides 
bounds on the derivative of the heat kernel, simplifying and generalizing existing 
results. The argument applies to many different environments, even reversibility is 
not necessary. 

1 Introduction 

Since the work of Yau [71] in 1975, where the Liouville property for positive harmonic 
functions on complete manifolds with non-negative Ricci curvature was proved, the struc- 
ture of various spaces of harmonic functions have been at the heart of geometric analysis. 
Some years later, Yau conjectured that the space of polynomial growth harmonic func- 
tions of fixed order is always finite dimensional in open manifolds with non-negative Ricci 
curvature. Extensive literature has appeared on this conjecture and related problems. Un- 
derstanding progressed quickly (Yau's conjecture was proved by Colding and Minicozzi 
[21]) and gave birth to many tools (see [56] for an introduction to the subject). 

In algebraic setting bounded harmonic functions played a central role since the in- 
troduction of the Poisson boundary by Furstenberg [35, 36], see also the survey [70]. 
Recently, the geometric approach made a remarkable appearance in the algebraic realm 
when Kleiner proved that the space of harmonic functions with polynomial growth on the 
Cayley graph of a group with fixed polynomial volume growth is finite dimensional using 
the approach of [21]. He used this fact to provide a new proof of Gromov's theorem [51] 
(see [66] for a quantitative version of this theorem). 

Another place where harmonic functions have played an important role recently is in 
the proof of the central limit theorem on random graphs. A central element in the proofs 
(see e.g. [67, 62, 14, 38]) is the construction of a harmonic function h on the cluster which 
is close to linear — the term x( x ) = h(x) - (x,v) is called the corrector and once one 
shows that x( x ) = °(\\ x \\)i the proof may proceed. 
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The focus of this article is the case of random graphs. Classical tools of geomet- 
ric analysis do not extend to this context in a straightforward way. Indeed, a random 
environment is not regular at the microscopic scale. In order to understand harmonic 
functions, one thus needs to control the macroscopic behavior of the environment. Let us 
take supercritical percolation as an example (see [39] for background and definitions). 

For p € (0, 1), consider the random graph G = (V(G),E(G)) defined by V(G) = V(Z d ) 
and E(G) being a random set containing each edge of Z d with probability p, independently 
of the other edges. It is classical that there exists p c (d) e (0, 1) such that for p < p c (d), 
there is almost surely no infinite connected component (also called cluster), while for 
p > p c (d), there is a unique infinite cluster. When p > p c (d), we denote this cluster by cu. 

Theorem 1. Let d > 2 and let p > p c (d). Then with probability 1, the infinite cluster u 
has no non-constant sub-linear harmonic functions. 

This immediately shows that the corrector \ is unique, as was conjectured by Berger 
and Biskup [14, Question 3]. 

In more regular settings, claims of this sort have been proved using the following 
strategy: try to show that two random walks starting at neighbors will couple before 
time n with probability bigger than 1 - Ctit 1 / 2 . This fact is classical in the case of 
the hypercubic lattice Z d where an explicit coupling can be exhibited. In the random 
context it is not clear how to construct an explicit coupling, but a number of approaches 
in the literature allow to construct a coupling indirectly. The known Gaussian heat 
kernel bounds (see (1) below) allow to construct a coupling that will fail with probability 
n~ e . Using also the central limit theorem already mentioned, one could improve this to 
^-1/2+0(1)^ Nevertheless, getting the precise ttt 1 / 2 seems difficult with these approaches. 
The approach we will apply below not only gives the precise order ttt 1 / 2 , but the proof is 
also significantly simpler than those just suggested. 

The proof uses an entropy argument similar to Avez [2] who showed that a Cayley 
graph satisfies the Liouville property if the entropy of random walk on it is sublinear. In 
fact the "if" here is an "if and only if" as was shown by Kaimanovich & Vershik [46] and, 
with a different approach, by Derriennic [30], but we will not need the other direction. 
Two extensions were known before for this result: it applies to random graphs [9] and it 
can be quantified [34, Section 5]. It turns out that both generalizations can be applied 
simultaneously. Further, Theorem 1 is but an example: the techniques work in great 
generality, even reversibility is not needed. Only stationarity of the walk and some weak 
(sub-)diffusivity are used. Precise assumptions are detailed below. 

The environment as viewed from the particle. Before stating the results, we pro- 
vide a precise definition of what we mean by 'environment'. We are interested in envi- 
ronments which are somehow translational invariant. This notion extends the transitivity 
condition to the random context. 

Consider a Markov chain P over some set V (formally a function P : V x V -> [0, 1] 
where P(x,y) denotes the probability to move from x to y). We always assume our 
Markov chain is irreducible, i.e. that for any v ,w e V there is an n such that P n (v,w) > 0. 
A rooted Markov chain is a triplet (P, V, p) where p e V is some vertex that will be called 
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the root vertex. Two rooted Markov chains are considered isomorphic if there is a one- 
to-one map <f> : V -> V with 4>{p) = p' and P(x,y) = P'((f)(x),(f)(y)). An environment as 
viewed from the particle, abbreviated as simply environment is a random rooted Markov 
chain. Two environments are considered to have the same law if they can be coupled such 
that the resulting rooted Markov chains are isomorphic with probability 1. 

Definition 2. An environment (P,V,p) is called stationary if it has the same law as 
(P, V,Xi) where X\ is the position of the first step of the Markov chain P, starting from 
p, and independent of the random choice of (P,V, p). 

Stationary environments are very common and we provide ten examples in the end of 
Section 2. Most of these examples are embedded in Z d and for these we could have used 
a much simpler definition — there is no need to "couple the environments such that the 
resulting rooted Markov chains are isomorphic with probability 1". One could just say 
that they have the same law considered as a random function P : Z d x Z d -*■ [0, 1], after 
shifting p to (formally this is a stronger requirement, but all our examples embeddable 
into Z d have it). However, Examples 2.6, 2.8 and 2.10 are not embeddable into Z d , and 
we found no simpler definition that catches all examples of interest. 

A very important subset of stationary environments is given by environments V with 
the structure of a weighted graph (with the weight being a positive function v on every 
couple (v, w) e E, and on every couple (v, w) E). P is given by 

v(v,w) ^ . 

P(v,w) = — where v(y) - v{v, x). 

v{v ) x 

These environments will be called random stationary graphs. This particular type of 
Markov chains is also commonly called reversible. The reversible case has a rich theory, see 
e.g. [1, 9] where one can also find many more examples. To clearly distinguish between the 
reversible and non-reversible case, random stationary graphs will be denoted by (G, u, p) 
where G is the graph, v is the weight function and p is the root. 

The graph distance in G is denoted by d G (-,-) and the ball of size r centered at x by 
B!^(r). We will also consider this distance in non-reversible setting, where it is simply the 
smallest n such that P n (x,y) > 0. Since the distinction between annealed and quenched 
statements will be clear in the context, we will often drop the dependence on G in the 
notation. For instance, P^r, d G (-,-) and B^(n) will become simply P x , d(-,-) and B x (n). 
We collected all notations and conventions used in this paper in the last section of the 
introduction (page 8). 

Non-constant harmonic functions with minimal growth. Let P be a Markov 
chain with state space V. Then a function h : V -> R is called harmonic if h{X n ) is a 
martingale, or in other words, if 

h(x) = Y l p ( x ,y)h(y) 

y 

As already mentioned, harmonic functions have had a number of important applica- 
tions recently. Let us expand on the particular application in Kleiner's proof of Gromov's 
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theorem. It was known before Kleiner that to show Gromov's theorem it is enough to 
show that any group with polynomial volume growth has a non-trivial finite-dimensional 
representation. Kleiner showed that any group has a non-trivial linearly growing harmonic 
function, and that groups with polynomial growth have a finite dimension of those (even 
of polynomially growing functions). Since the group acts on harmonic functions on its 
Cayley graph by translations, this provides a finite dimension representation and proves 
Gromov's theorem. Shalom and Tao [66] showed that a quantitative version of Kleiner's 
proof can be performed. Further, they characterized the linearly growing harmonic func- 
tions — for groups with polynomial volume growth these are the non-constant harmonic 
functions with minimal growth [41, Theorem 6.1]). They showed (personal communica- 
tion) that when the group is nilpotent, any such function must be a character of the group 
(or the sum of a character and a constant), in analogy to the Choquet-Deny Theorem 
[20, 60]. For virtually nilpotent groups this holds mutatis mutandis. We plan to analyze 
harmonic functions with minimal growth in the context of Cayley graphs, especially of 
wreath products, in a future paper. 

We now return to the setting of this paper, i.e. of stationary random graphs. Using 
the entropy of the random walk, it is possible to bound from below the minimal growth 
of non-constant harmonic functions in terms of the rate of escape of the random walk. A 
particularly interesting case is the case of stationary environments with diffusive behavior, 
for which the bound is often sharp. A stationary environment (P, V, p) satisfies diffusive 
or sub -diffusive behaviour (SDB) if 

(SDB) There exists C > such that E (d(p, X n ) 2 ) < Cn for every n. 

where here and below E is the average over both the environment and over the walk (the 
so-called annealed average). We may now state our main result: 

Theorem 3. Let (P,V,p) be a stationary environment such that E(|B p (?7,)|) < Cn d for 
some constants C, d < oo independent of n. If (P,V,p) satisfies (SDB), then for almost 
every environment, there are no non-constant sublinear harmonic functions. 

As already stated, it applies to many different models, some of them significantly less 
well-understood than percolation. See a list of examples at the end of Section 2. 

Whether (SDB) follows from polynomial growth in the reversible case is an interesting 
question. The Carne-Varopoulos bound [19, 69] gives that E p (d(/9, X n )) < C\Jn logn 
which would give (with the same proof as that of Theorem 3, see Theorem 3' in Section 
2) that any stationary random graph with polynomial volume growth has no non-constant 
harmonic functions h with h(x) < Cd(p, x)ly\ogd(p, x). Without stationarity the Carne- 
Varopoulos bound \J n log n cannot be improved, as was shown by Barlow & Perkins [8]. 
Kesten gave a beautiful argument that a stationary random graph embedded in Z d satisfies 
(SDB), see e.g. [8, Section 2]. But it does not seem to apply just assuming polynomial 
growth. 

The relation between entropy, harmonic functions and speed of the random walk holds 
for more general environments (for instance with larger growth). We defer to Section 2 
for a more complete account of this question. 



4 



Polynomially growing functions. As in the case of manifolds, we are interested in 
the dimension of the space of harmonic functions with prescribed polynomial growth. Of 
course, one can encounter very different behavior depending on the environment (like 
in the deterministic case). Hence we will assume that our environments satisfy volume 
doubling and Poincare inequality. In this section we do not assume stationarity, and 
hence we do not need a root for our graph. The object of interest is therefore simply a 
random weighted graph (G, u). Here is the precise formulation of our assumptions on the 
environment: 

(VD) G (G, v) satisfies the volume doubling property (VD) G if there exist 
< Cvd < 00 such that for every A < 00 and for every x e B p (An), 
v(B x (2n)) < CvoKB^n)) for n large enough, where ^(B) is the 
total weight of the edges in the ball B. 

(P)g (G, v) satisfies the Poincare inequality (P)g if there exists Cp < 00 
such that for every A < 00, for every x e B p (An) and / : 'B x (2n) -> M, 

E (f(v) -7B,(n)) 2 Ky)<c P n 2 Y, \f(y)-f(z)\ 2 v(y,z) 

ycB x (n) (y,z)eE(B x (2n)) 

for n large enough, where 

u{B x {n)) yeBx(n) 

Cvd an d Cp may depend on the random choice of (G, v) (though we do not have any 
interesting example which actually uses this freedom). The minimal n from which the 
properties hold may depend both on the environment and on A. 

These properties are classical in geometric analysis. They go back to the theory 
developed by De Giorgi, Nash and Moser [63, 64, 65, 26] in the fifties and sixties for 
uniformly elliptic second-order operators in divergence form. In the deterministic context, 
they imply the Harnack principle and Gaussian bounds for the heat kernel. The versions 
above are tailored for the random case: they take into consideration that in most examples 
of interest these properties do not hold from every point since some unusual points always 
exist. So we require that they hold for balls which are not too far from our root p, 
relative to their size. This is reminiscent of Barlow's good and very good balls [4], but 
our requirements are much weaker, we only need the properties to hold for "macroscopic 
balls", balls whose distance to p is proportional to their size. 

Let us remark on the appearance of the number 2 in B x (2n) in both properties. 
For the volume doubling property it is clear that these properties are equivalent for all 
choices bigger than 1 i.e. if one were to define a "3- volume doubling property" then it 
would be equivalent to the "2-volume doubling property" defined above, though perhaps 
with different Cvd and minimal n. The same holds for the Poincare inequality, under the 
assumption of volume doubling. This is well known in deterministic settings, see e.g. [45, 
§5], and the proof carries over to the random case with no change. 

With these definitions we can state the following easy but, we believe, conceptually 
important theorem: 
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Theorem 4. Let (G,u) be a random weighted graph (not necessarily stationary). If 
(G,u) satisfies (VD) G and (P) G , then for every k > 0, the space of harmonic functions 
with h(x) < d(p, x) k for all x large enough, is finite dimensional. 

Further, the bound on the dimension depends only on Cvd and Cp. 

This theorem represents a randomized discrete version of Yau conjecture (except that 
the Poincare inequality must be assumed since it is not automatically satisfied). The 
proof of this theorem follows the existing path of [21, 27, 51, 66, 68]. Let us stress again 
that the interesting part is that is requires only macroscopic volume growth and Poincare 
inequality: the definitions of (VD) G and (P) G only examine balls of radius n inside 
Bp(An) for some finite A. 

Since the dimension depends only on Cvd and Cp, then in particular, if these constants 
are not random, neither is the bound. Thus, for example, in supercritical percolation 
there is a constant A (depending only on the dimension d and the probability p) such 
that Cvd ^ A and Cp < A almost surely (the minimal n is the only quantity which really 
changes between configurations). Hence for each k there is a number such that the 
dimension of harmonic functions of growth smaller than \x\ k is smaller than almost 
surely. 

Linearly growing functions. In the special case of environments which are modifi- 
cations of Z d , we can compare the dimension of harmonic functions with a prescribed 
growth to the dimension of harmonic functions on 7L d . The simplest perturbation of Z d is 
the supercritical cluster of percolation. We prove the following theorem: 

Theorem 5. For p > p c (d), let u be the unique infinite component of percolation on 7* d . 
Then, the dimension of the vector space of harmonic functions with growth at most linear 
on u is equal to d + 1 almost surely. 

This theorem must be understood as a first step towards a bigger goal, which would 
be to compute the dimension of all spaces with prescribed growth. 

The properties of the supercritical percolation cluster that we harness in this proof are 
quite general. First we use the <i-dimensional volume growth and the Poincare inequality 
(P) u , proved (in stronger form) by Barlow [3], as well as the Gaussian bounds which 
Barlow concludes from these. But the main ingredient of the proof is an invariance 
principle [67, 14, 62]. All these properties witness the close relation between macroscopic 
properties of the supercritical percolation cluster and R d . In some sense, it confirms the 
fact that this cluster is an approximation of Z d . 

Heat kernel estimates. Classically it is known that the kernels of symmetric diffu- 
sions have some Holder regularity. By 'classically' we refer to [63, 64, 65, 26]. In random 
environments, few results are known on Holder behavior: Conlon and Naddaf [23] and 
Delmotte and Deuschel [29] treated the case of random conductance with a uniform ellip- 
ticity condition, see also [38]. The entropy techniques developed for the proof of theorem 
1 allow to give a very short proof that the space derivative exists. Moreover, it applies in 
a very general context. We present the case of percolation. 
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Theorem 6. Let P p be the measure of the infinite cluster of percolation (denoted u) on 
Z, d . There exist C3, C4 > such that for every n > and x, x', y at distance less than n of 
0, x ~ x' , 

Ep[ (Pn(x,y) -p„-i(x',|/)) 2 l {y ^ xl&1J } 1 < -^T exp[-C 4 |x-y| 2 /^], 

where p n (y,x) := Pj / (X ra = x) and X n zs the random walk on oj. 

Estimates for the heat kernel itself (i.e. not for the derivative) are well understood, 
and are known as Gaussian estimates (GE). Heuristically, Gaussian estimates are bounds 
of the form 

C C 

-± exp [ - C 2 \x - y\ 2 /n] < P x [X n = y] < exp [ - C A \x - y\ 2 /n] . 

A few caveats are in place, though. The lower bound cannot hold if there is any kind of 
periodicity (as in Z d or in subgraphs of it, such as supercritical percolation). One should 
talk about continuous time random walk, lazy random walk, or replace P x pT n = y] with 
P x pT n = y] +P x [X n+ i = y\. Further, the lower bound does not hold for x and y extremely 
far away — if \x-y\ > t then the probability is just zero (in the simple random walk case). 

In the case of the infinite cluster of supercritical percolation, these bounds were ob- 
tained for continuous time random walk in [4]. They also hold for simple random walk, 
most of the details are filled in in [7]. Again, one should be careful, as (with small 
probability) the environment in the neighborhood of p might be atypical, breaking these 
estimates for small t. Hence the formulation is as follows. There exist strictly positive 
constants C±, C 2 , C 3 and C 4 such that for almost every environment co there exist random 
variables n x (u),x e 7h d so that for every x,y e u and n > n x (u), \x - y\. 

{ h ■ exp [ - C 2 \x - y\ 2 /n] < P x [X n = y] + P x [X n+1 = y] < exp [ - C 4 |x - y\ 2 /n]. (1) 



n d/2^l ' / J * C L '* y j Ti+L tii - nd/2 

Moreover, the random variables n x (u) satisfy a stretched exponential estimate i.e. 

F p (x eu,n x (u)> s) <ce' cs£ (2) 

for some e > 0. 

For the proof of Theorem 6 we only need the upper bound in (1). For the proof of 
Theorem 5 we will also need the lower bound, but only in the regime \x - y\ « \fn i.e. in 
the regime where the probabilities are « n~ d l 2 . 



Organization of the paper. In the next section, we study the notion of mean entropy 
of random walks on a stationary random graph to bound the total variation between 
random walks starting at neighbors. We deduce Theorem 3. Section 3 contains the 
proof that (VD) G and (P) G imply that the space of harmonic functions of prescribed 
polynomial growth is finite dimensional, i.e. Theorem 4. Section 4 deals with the example 
of the supercritical percolation cluster and analyzes the space of linearly growing harmonic 
functions. It is completely independent of Section 3. Section 5 contains the proof of 
Theorem 6. Section 6 regroups all the open questions. 
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Notations. To make the distinction between the reversible and non-reversible case clear, 
we call the general case "Markov chain" and denote it by (P, V), where V is the space 
and P : V x V -> [0, 1] are the transition probabilities, P(x,y) being the probability to 
move from x to y. We often write P n which we interpret as a matrix power — of course, 
P n (x,y) is also the probability that a random walk starting from x will be at y after n 
steps. 

The reversible case is called "graph" and denoted by (G, is) where G is a graph and 
v are weights on the edges i.e. v : E(G) -> [0, oo). Here and below we denote by E(G) 
the set of edges of the graph G, and for a set of vertices S we denote by E(S) the set of 
edges between the vertices of S. The notation x ~ y for two vertices will mean that they 
are neighbors in the graph. 

We also consider v as a measure. So for a vertex x we will denote v{x) = Ej/~rr K x >2/) 
while for a set of vertices S we will denote v(S) - T. X eS u (. x ) ■ Note that edges between 
two vertices of S are counted twice in this sum. 

For a fixed graph or Markov chain we denote by E the expectation with respect to 
the random walk on that fixed graph. When we want to note the starting point of the 
random walk we will use subscripts, i.e. E p . E is used to denote the expectation with 
respect to both the environment and the random walk (the "annealed" average). Similarly, 
bold letters will usually denote "quenched" objects, i.e. objects related to an instance G 
of the environment. The quantity d(x,y) will denote the graphical distance between two 
vertices x and y of G, i.e. the length of the shortest path in G between x and y, or, in the 
non-reversible setting, the minimal n such that P n (x,y) > 0. B x (r) will denote the ball 
{y : d(x,y) < r}. Cj will denote constants which depend on the environments G, while 
constants of the form Cj are uniform in the environment. We will occasionally write just 
c or C for a constant — different appearances of c or C might be different constants. 

Finally, for a random variable X we denote by Sf(X) the law of X, i.e. the measure 
on the space of values of X induced by it. If £ is some event then we will denote by 
St? (X\£) the law of X conditioned on £ happening. For a set E we will denote by |.E| 
the cardinality of E. 

2 The Entropy Argument 

The connection between entropy and random walks was first exhibited by Avez [2] and then 
made famous in a celebrated paper of Kaimanovich and Vershik [46] (see also Derriennic 
[30]). For any discrete variable X the entropy is defined by 

H{X) = Y,HP(X = x)) <Kt) = -tlogt, 0(0) = 0. 

X 

Consider a stationary environment (P, V, p) with law P. Conditionally on (P, V, p), define 
the entropy of the random walk started at x at times n, m by 

H™(P,V,p) = H(X n ,X m )= £ 4>(P p (X n = x,X m = y)) 

x,yeV 
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When n = m, we simply denote H"(P, V,p) by H„(P, V, p). In the random context, we 
define the mean entropy (see [9]) by 

H™ = E[H™(P,V,p)] and H n = E[U n (P,V, p)]. 

There are many ways of measuring the distance between two probability measures p 
and v on some set V, the most standard being the total variation ||/i-zv||?v : = Y, X zV IM^) - 
v(x)\. In this article, we will use a less standard one. Define A(p,u) by the formula 



A(p,u) := 



(fi(x)-u(x)Y 



E 



2T 1/2 



(3) 



Estimating the distance using A is stronger than via the total variation: by Cauchy- 
Schwartz, 

\\p - u\\ TV = £ \p(x) - u(x)\ = £ ^p(x) + v(x)^l=^ < 
x x \J pyx) + vyx) 



\ 



;?^) + K,))( ? ^f)^A ( ^) ( 4, 



This quantity has an advantage compared to the total variation: for any / : G -> R, we 
have (using Cauchy-Schwarz similarly) 

K/)-K/)|<A(/.^)(M/ 2 ) + K/ 2 )) 1/2 - (5) 

We will always be interested in the particular case of random walks. In order to lighten 
the notations, we set 



A n (x,y) : = A(j?(X n \X = x),^(X n _ 1 \X = y)) 
= A(S?(X n \X = x)^{X n \X 1 = y)), 



(6) 



the last equality following by the Markov property. Recall that J£{X\£) denotes the law 
of X conditioned on $ . Note that the second measure is the law of the random walk after 
n - 1 steps, so the definition is not symmetric in x and y. 

Theorem 7. Let (P, V, p) be a stationary environment. For every n > 0, we have 

E(A n (p,X 1 ) 2 )<2(H n -H n _ 1 ) (7) 

(as usual E is over both the environment and the randomness of Xi). 

Before proving Theorem 7, we state a result from [9] concerning H™. We isolate it from 
the rest of the proof because it is the only place where stationarity is used (stationarity 
replaces transitivity as used in the context of groups). 



Lemma 8. Let (P, V, p) be a stationary environment. For every n > 0, we have H" = 
H n -i + Hi . 
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Proof. Fix n > 0. We first use the conditional entropy formula H(X, Y) = H(X\Y)+H(Y). 
In our context this is simply 

HUP,V,p)= £ <f>(P p (X 1 = x,X n = y)) 

x,yeG 

= £ P P (Xi = x) Y, <P(P P (Xn = y\X 1 =x)) + Y 0(Pp(^i = x)) 

xzG ytG xeG 

which we simplify using the Markov property giving 

P p (X n = y\X 1 = x) = P x (X n _ 1 =y). 
Taking the expectation with respect to the environment we obtain 



H? = E 



YP P (x 1 = x)Y<P(P x (x n -i = y)) 

y^G 



x~p 



+ E 



Y 0(P,(*1 = X)) 



xeG 



= E [H^P, V, X,)] + E [Hx(P, V, p)] = H n _ x + E x 

where in the last equality we used the fact that (P,V,Xi) has the same law as (P, V, p) 
- formally, we use the coupling that makes them isomorphic with probability 1, and 
since a Markov chain isomorphism preserves the entropy, the expected entropy of the two 
environments (P, V,Xi) and (P, V,p) must be equal (this is not a property of entropy, it 
would be true for any function of the environment). □ 

Before continuing let us state one corollary of Lemma 8 which is not necessary for the 
proof of Theorem 7 but does shed some light on the quantities involved. 

Corollary 9. H n - H n -\ is decreasing. 

Proof. By Lemma 8, 

H n - H n _i = H n - H{ 1 + H 1 = E[H n - H™] + H 1 . 

and H n - H™ can be written as the conditioned entropy -H(X 1 \X n ) where X n is the 
random walk at time n (this statement is quenched). This, however, increases since 



H(Xi\X n ) = H(Xi\X n ,X n+ i) < H(Xi\X n+ i) 



(8) 



where the equality is due to the fact that conditioned on X n , knowing X n+ i gives you no 
information about what happened before time n. The inequality in (8) is a generic fact 
about entropy — conditioning on more information reduces the relative entropy. Hence 
H n - decreases, and so does its expectation. □ 

Returning to the proof of Theorem 7, let us recall another well-known fact. For any 
two random variables X and Y, H(X,Y) < H(X) + H(Y) with equality holding if and 
only if X and Y are independent. The next lemma gives a quantitative version of this 
fact. 
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Lemma 10. For any two random variables X and Y, 

£P(F = y)A 2 (J?(X),J?(X\Y = y)) < 2(H(X) + H{Y) - H(X, Y)). (9) 

y 

Proof. We start with the fact that for t > 

(t - 1) 2 

2tlogt>^ J - + 2t-2 (10) 

8 t + 1 y ' 

(This can be seen by Taylor expanding tlogt to the second order at 1.) For shortness 
denote 

p(x)=F(X = x) p(y) = F(Y = y) p(x,y) =F(X = x,Y = y). 
Then the left-hand side of (9) is (recall the definition (3) of A), 

LRS = Zp(y)Z (p(x,y)/p(y) ~ p(x))2 - 



p(x,y)/p(y)+p(x) 

= Yp(x)p(y) ( (K^)M^M?/) - 1) 2 + 2 p&y) _ 2 \ 

\ p( x ,y)/p( x )p(y) + 1 p{x)p{y) ) 

v » ' 



where we were allowed to add the expression denoted by ^ since summing over x and y 
makes both terms equal to 2 and cancel out. Thus, we get 

LHS < 2^p(x)p(y) log 



p{x)p{y) p{x)p{y)j 
= 2 £ p(x, y) {\ogp{x, y) - \ogp{x) - logp(y)) = 2(-H(X, Y) + H(X) + H(Y)) 



where in the last equality we used that Y, y p(x,y) = p(x) and Y, x p{ x iy) = P(y)- ^ 

Proof of Theorem 7. This is a direct corollary of Lemmas 8 and 10. Indeed, by Lemma 
10, ' 

E(A n (p,X 1 ) 2 ) = £P(X! = x)A(Sf(X n ),Sf(X n \X 1 = x)f 
<2(H 1 + H n -H?). 

We now take expectation with respect to the environment and get from Lemma 8 

E(A n (p, X x f) < 2E(Hi + H n - H") = 2{H n - H n ^). □ 

We are now in a position to prove Theorem 3: 

Proof of Theorem 3. We only need to prove that for almost every environment, h(p) = 
h(Xi) for any sublinear harmonic function. Indeed, stationarity would then imply that 
for almost every P, h(X n ) = h(X n+ i) for any sublinear harmonic function. Since the 
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Markov chain is irreducible, (X n ) can visit any vertex and we deduce that almost surely 
any sublinear harmonic function is constant. 

For any harmonic function h with respect to the environment we have 

h{p) = E p {h{X n )) h{X x ) = E Xl (h(X n ^)). 

Hence 

\h(p) - h(X 1 )\ = \E p [h(X n )] - E Xl [h(Xn-i)] 
By (5) < A n (p,X 1 )^E p [h 2 (X n )] + E Xl [h 2 (X n _ 1 )]. 

Taking expectation over Xi gives 

E p \h(p) - h(X 1 )\ < E p [A n (p, X^EplViXn)] + E Xl [h 2 (X n _ 1 )]] 

by Cauchy-Schwarz < \f2E p [A n (p,Xi) 2 ] E p [h 2 (X n )] (11) 

where in the last line we also used that E p [E Xl [h 2 (X n -i)] = E p [h 2 (X n )]. 

Now, assume that h has sublinear growth. First, for any e > 0, there exists a constant 
K such that for all x eV, 

h 2 (x) <ed(x,p) 2 + K. (12) 
Second, the Markov chain has annealed polynomial growth, therefore the entropy satisfies 

H n < E[log|B»|] < logE[|B p (n)|] < log[Cn d ] 

and is at most logarithmic (we use the fact that log is concave). Hence H n - if n _i < c/n 
for infinitely many n. Using Theorem 7 and (SDB) we get 

E[nA n ( / 9,X 1 ) 2 ] +E[n- 1 d(X n ,p) 2 ] < C for infinitely many n. 

Hence, by Fatou's Lemma, for almost every environment there exists Ci < oo such that 

EplnA^p^^ 2 ] + E p [n- 1 d(X n ,p) 2 ] < ci for infinitely many n, (13) 

where this time the sequence of n for which it holds depends on the environment, i.e. is 
random. Putting (12) and (13) in (11), we deduce that for almost every environment, 
and for every h harmonic and sublinear on it, 

E^^-hiX^Kc^ 2 . 

Letting e go to 0, we deduce that h(p) = h(Xi) almost surely for any sublinear harmonic 
function. □ 

Inequality (11) relates the entropy to the value of possible harmonic functions at X n . 
Its use is not restricted to the case of diffusive environments with polynomial growth. 
For instance, one can use this inequality to prove a characterization of almost sure Liou- 
ville property for stationary random graphs (this was proved in [9] using a more direct 
generalization of [46]). For completeness, we state the result in [9] here. 
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Corollary 11. [9] Let (P,V,p) be a stationary environment. If H n /n converges to 0, 
then P has the Liouville property (i.e. has no non-constant bounded harmonic functions) 
almost surely. 

We would like to emphasize why we use A(//, u). Csiszar's inequality [24, 25] relates 
the total variation between two measures to their relative entropy. In our context, an 
inequality involving the total variation can also be found, hence giving a bound on the best 
coupling (in time) between two random walks starting at neighbors. For completeness, 
we state the inequality here (it is a consequence of (4) applied to (7)): for a stationary 
environment (P, V, p) and n > 0, we have 

E (|| j2f(X n )-^f(X n |X 1 )||| y )<4(i/ n -#„_!). 

Interestingly, this inequality is not strong enough for our applications, since controlling 
the probability that two random walks merge before time n says nothing about their 
behavior when they do not couple. 

Examples. We finish this section by presenting a collection of examples. 

Example 2.1 (Random conductance). Consider the graph Z d and let v be given by a 
shift-invariant law (for example i.i.d. positive random variables). We assume that the 
set of sites connected by edges with positive conductances is infinite. The random walk 
induces a Markov process on the environment, cf. Kipnis and Varadhan [50], called the 
environment as seen from the particle. This process can be made stationary by weighting 
each configuration proportionally to u(p). 

This model has been studied extensively. Under the assumption of uniform ellipticity: 
3a > : P[a < u(x,y) < 1/a] = 1, many things are known on the environment. First, the 
Poincare inequality is a direct consequence of the Z d case. Second, Delmotte proved in 
[28] that the Poincare inequality implies that there exist Ci,C2 > such that 

P,[X„ = *]<^e-* 2 " 

(a corresponding lower bound also holds but is not needed for our purposes). Third, 
an annealed invariance principle holds in the sense that the law of the paths under the 
measure integrated over the environment scales to a non-degenerate Brownian motion 
[50]. In particular, Theorem 3 applies in this case. 

Once the assumption of uniform ellipticity is relaxed, matters get more complicated. 
An example of random conductance models without uniform ellipticity is the infinite 
cluster of percolation which we will discuss next. For an unusual example of a transitive 
conductance model, see the work of Disertori, Spencer and Zirnbauer [32] who reduced a 
supersymmetric hyperbolic sigma model to the study of random walk on a certain (highly 
correlated) random environment. 

Example 2.2 (Infinite cluster of percolation). Consider the percolation measure with a 
parameter p such that there exists an infinite cluster with probability 1. See [39] for details 
about percolation. Set Pq to be the law of the infinite cluster conditioned to contain 0. 
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As in the previous example, the random walk on to induces a Markov chain on the space 
Q of infinite subgraphs of Z d containing the origin. When weighting each configuration 
proportionally to the number of neighbors of the origin we obtain a stationary measure 
with respect to the shift along the random walk. 

Since the infinite cluster of percolation can be seen as a stationary random graph 
with polynomial volume growth and since the random walk is diffusive [49, 4], Theorem 3 
applies, and we get Theorem 1. 

Example 2.3 (Centered random environments). This is our first non-reversible example. 
A centered random environment is, roughly speaking, a Markov chain on Z d such that 
the probabilities can be "decomposed" into a sum over cycles. Such environments, even 
when non-reversible, are still heuristically quite close to reversible, and in particular they 
have a stationary version which is related to the usual version by an explicit reweighting, 
like in the reversible case [31, Section 3]. See Deuschel and Kosters [31] for a proof of a 
CLT, which implies (SDB) — of course, a CLT is much stronger than (SDB) — and 
hence our results apply. 

Example 2.4 (Balanced random environments). This is another non-reversible example, 
which is "farther" from reversible than the previous one. A balanced random environment 
is a Markov chain P with state space Z d and nearest neighbor movements, such that 
for every x e Z d and every unit vector e^, P(x,x + e^) = P(x,x - e^). It follows that X n 
is a martingale, and hence (SDB) is an immediate corollary of the Azuma-Hoeffding 
inequality. The issue is therefore only stationarity. In the case that the environment /i 
is uniformly elliptic and stationary and ergodic to the action of Z d (this is different from 
our notion of stationarity!), Lawler [54, Theorem 3] showed that there exists a stationary 
measure (in our sense) A which is mutually absolutely continuous with respect to /j. Hence 
our results apply to A, and hence also to /i. Guo and Zeitouni weakened the requirement 
of uniform ellipticity to just ellipticity, at the price of restricting the environment to 
the i.i.d. case [40]. Berger and Deuschel [15] have removed the requirement of ellipticity 
altogether. 

Example 2.5 (Random environments with cut points). Under certain conditions, one can 
prove that a random walk in non-reversible random environments in Z d , d large enough, 
has cut points, and deduce from that a CLT and the existence of a stationary environment, 
hence our techniques apply. See [17] for the details. 

Let us give one example which is not embedded in Z d , and in fact has unbounded 
degrees. 

Example 2.6 (Poisson point process). Examine a Poisson point process in M, d . Add the 
point (this is often called "the Palm process") and let it be the root. Construct a graph 
by some process invariant under translations of R d . For example, connect any two points 
by an edge with weight which depends on their Euclidean distance [18] or construct the 
Delauney triangulation [37]. Give each configuration a "probability proportional to the 
total weight of 0". The resulting process is stationary and diffusive (see e.g. [18, §2.1] or 
[37, Lemma A.l] for stationarity — subdiffusivity can be deduced from [8, Section 2] or 
from the two previous papers). Hence our theorem applies. 
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Figure 1: A portion of the graphical Sierpinski gasket. 

The previous examples dealt with random walks which are diffusive. An interesting 
situation, which cannot hold in the case of groups, is environments with subdiffusive 
behavior. We give four examples of these. 

Example 2.7 (Graphical fractals). A graphical fractal is a graph which is constructed like 
one of the classical fractals (the Sierpinski gasket, for example) but inside out — bigger 
pieces of the graph are constructed from smaller pieces by connecting them in a repeated 
fashion (see [5] for precise definitions and main properties). See figure 1 for an example, 
the graphical Sierpinski gasket. A graphical fractal always has an invariant measure and is 
always diffusive or subdiffusive, and in many examples is in fact subdiffusive, see e.g. [3]. 
Let us remark that a significant part in the remarkable work of Barlow and Bass on the 
Sierpinski carpet [6] has to do with the construction of a coupling, so a tool (like the one 
described in this section) that gives easy proofs that couplings exist should be useful. 

Example 2.8 (Critical Galton- Watson trees). The critical Galton- Watson tree with any 
offspring distribution conditioned to survive is stationary (see [47, 59, 58]) and subdif- 
fusive. If the offspring distribution has finite variance, the diffusivity exponent y,3 was 
proved in [49]. 

Example 2.9 (Infinite Incipient Cluster). Consider the critical percolation on 7L d condi- 
tioned on the fact that the origin is connected to infinity [48] — one cannot really condition 
on this event which has probability (proved in d = 2 and high d and conjectured in the 
others), but the object can be defined properly using a limit process. For example, one 
may take p c + e percolation, condition on being in the cluster and then take a limit of 
the resulting measures as e -*■ 0. Since for each e the measure is stationary (as usual after 
reweighting the configurations proportionally to the degree of 0), so will be their limit if it 
exists (or any subsequence limit in general). The limit is known to exists in 2 dimensions 
[48, 44] and in high dimensions [43, 42]. It was proved in [49, 52] that the random walk 
is subdiffusive on this cluster (in high dimension the diffusivity exponent is ^3, as on the 
tree). Since it is embedded in Z d , it grows no faster than polynomially. Therefore, there 
are no linear growth harmonic functions on it. 

Example 2.10 (UIPQ). Let G n be fixed or random finite graphs. Take p n to be a random 
vertex in G n , selected according to the stationary measure on G n . Then the limit of 



15 



(Gn,Pn), if it exists, is called the graph limit [13]. This limit is always stationary [53, 
Section 1.3]. 

A particular case is when G n is a planar quadrangulation with n faces, chosen uni- 
formly. The graph limit is known as the uniform infinite planar quadrangulation. It was 
proved to be subdiffusive with diffusivity exponent bounded from above by in [10]. 
Thus, there are no linear growth harmonic functions in this case as well. 

Other growth rates. Examining the proof of Theorem 3 the same argument can also 
be used with growth rates bigger then polynomial. A general statement would be 

Theorem 3'. Let (P,V,p) be a stationary environment. Then there are no non-constant 
harmonic functions h which satisfy 

E p [h(X n ) 2 ]-(H n -H n ^)^0 

even if this holds just along a subsequence of n. 

In particular this holds for fixed transitive graphs, which is a version of a result of [34, 
Section 5]. 

A remark on connectivity. We assumed throughout that the environment (P, V, p) 
is irreducible, i.e. that for any v,w e V there is some n such that P n (v,w) > 0. This 
assumption was only used once: we showed that a not-necessarily-irreducible stationary 
environment satisfies that every harmonic function h has h(p) = h(Xi) almost surely, and 
concluded, using irreducibility, that h is constant. The assumption of irreducibility is 
of course necessary, as a disconnected graph always has bounded non-constant harmonic 
functions, namely functions which are constant on each component, but with different 
constants. 

Nevertheless, in the non-reversible case, the assumption of irreducibility can be weak- 
ened slightly: we only need to assume that for every v and w there exist n, m and x 
such that P n {v,x) > and P m (w,x) > 0. The proof is the same — since h(p) = h(Xi) 
almost surely then this gives that h{y ) = h(x) = h(w) almost surely and h is constant. 
An example of a stationary graph satisfying this is as follows: take a 3-regular tree T, 
choose a height function I (i.e. a function such that each vertex has one neighbor with £ 
bigger by one, and two neighbors with £ smaller by one), and orient all edges "up" i.e. in 
the direction of the larger I. Of course, the random walk on the resulting graph is so 
degenerate it can hardly be called random, as each vertex has only one outgoing edge. 
But this is irrelevant at this point. This environment is not irreducible in the usual sense, 
but does satisfy the weaker assumption and hence our results apply (again, in this case it 
is simple to analyze the harmonic functions directly). Multiplying by Z will give a slightly 
less trivial example. 

3 Polynomial growth harmonic functions 

In this section we prove Theorem 4. The proof boils down to the observation that macro- 
scopic Poincare inequality and volume growth estimates are sufficient. The strategy fol- 
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lows the lines of Shalom and Tao [66, 68], where a quantitative version of Gromov's 
celebrated theorem on groups of polynomial growth (any group of polynomial growth is 
virtually nilpotent) is proved. The proof is inspired by an elegant proof of this theorem 
due to Kleiner [51] harnessing spaces of harmonic functions with polynomial growth in a 
crucial way. We start with a very general inequality, called the reverse Poincare inequality, 
which holds in any graph. For the sake of completeness, we prove it in our context. 

Proposition 12 (Reverse Poincare inequality). For any weighted graph (G,v) and any 
function h: G ->M. harmonic on a ball ~Q x (2n), 

£ (h(z)-h(y)fv(y,z)<- 2 £ Kyfu{y). (14) 

(y,z)eE(B x (n)) U yeB x (2n) 

for every x e G and n > 0. 

Proof. For this proof, we denote the quantity f(x) by f x . Let h ■ B x (2n) -> R be harmonic 
and let be a function such that <p y = 1 for y e B x (n), 4>y - for y £ B x (2n - 1) and 
\<f> y - 4> z \ ^ l/n> f° r all y ~ z. For example, 

d(y, x) 

mm ( ' 



We have 



E (hy-h,) 2 u(y,z)= Z M + 4> 2 z){h y -h z fv(y,z). (15) 

E(B x (n)) E(B x (n)) 

To make the calculation a little shorter we represent the sum on the right-hand side of 
(15) as a sum of \4>1{h y -h z ) 2 over directed edges. Denote by E* the set of directed edges 
in B x (2n) i.e. both (y,z) and (z,y) appear in E* and are different. For an edge (y,z) € 
E*, an easy computation shows that (f)y(h z - h y ) 2 equals the lengthy yet straightforward 
quantity 

(h z (f) 2 z - h y (j) 2 y){h z - h y ) - h z {(j) z - (f> y ) 2 (h z - h y ) - 2h z <p y {4> z - (j) y ){h z - h y ). 

We start by dealing with the first term. Integration by parts and the fact that h(f> 2 vanishes 
on the boundary of T5 x (2n) imply 

Y,(h z (f)l-hy(f) 2 y)(h z -hy)is(y,z) = 2 Y, My \Y,( h v- h zMz,v)) 

E* yeB x (2n) \z~V I 

Since h is harmonic, this sum equals 0. 

For the second term, since \h z (h z - h y )\ < \h 2 z + \h 2 and \<p z - <p y \ < 1/n, we have that 
each summand is bounded by (3h z + h 2 )/(2n 2 ). When summing over E* we obtain 



Y, h -Mz - (p y ) 2 (h z ~ h y )u(y, z) 

E* 



' l B x {2n) 
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For the third term, note that 

\h Z (Py(<P Z ~ <f)y)(h Z ~ hy)\ < ^(hy - h Z ) 2 (j) 2 y + hl((f) Z - (j)y) 2 . (l6) 

So, 

Y, \h z (j) y ((p z ~ (p y ){h z - h y )\u(z,y) 

E* 

By (16) < W( h y-h z ) 2 <p 2 y v(y,z) + Y l h 2 z (<j )z -0y) 2 v(y,z) 

4 E* E* 

* \T J {h y -h z f4>lv{y,z) + - 2 £ h 2 y v(y) 

E* 71 B x (2n) 

using the bound \(f> z - <p y \ < \ for every y ~ z. Putting the bound on the different terms 
together leads to 

1 4 
Y,( h v - h z ) 2 (f> 2 y v(y,z) < - Y,(h y - h z ) 2 (f) 2 y is(y,z) + — ^ h 2 y u(y). 

E* E* 71 & x (2n) 

which gives 

Y, {K-hy) 2 dv(y,z)<\YXh z -hy) 2 <\) 2 y v(y,z)<— 2 ^ h\v{y). □ 

E(B x (n)) 1 E* n B x (2n) 

Lemma 13. Let (G, u) be a random graph satisfying the volume doubling condition 
(VD) G almost surely. Then there exists c > such that for any finite X and n large 
enough, there is a covering of the ball B p (An) by less than M\ balls TS yi (n), TS yk (n) 
satisfying that every point x e B p (n) belongs to at most c balls B yi (2n). 

Further, c depends only on the volume doubling constant Cvd ; and M\ depends only 
on A and Cvd- 

We call a covering with this property proper. 

Proof. Let A and G be as above. Let n be large enough so that (VD) G holds for our A. 
Given this, we can choose a maximal family of disjoint balls B yi (n/2), . . . , H yk (n/2) with 
yj € Bp (An) for all j. 

• Since the family {B yj (n/2)} is maximal, every vertex in B p (An) must be within 
distance < n from one of the yj, so B p (An) is covered by ~B yi (n), . . . , H yk (n). 

• For any x £ B p (An), if x e B % .(2n), then B % (n/2) c B x (3n). Using volume doubling 
we see that u(B x (3n)) < Cy D i/(B % . (n/2)), hence (since these balls are disjoint) we 
have that the number of yj such that x e H y (2n) is at most Cy D . 

• We use volume doubling similarly and get 

KB p ((A + l)ra))<Ci/(B w (n/2)) Vj. 
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(the constant is + l+ ). Since these balls are all disjoint and fully contained 

in B P ((A + l)n), we get 

kmmu(B y .(n/2)) < z/((jB % (n/2)) < u(B p ((X + l)n)) < Cmmv(B y .(n/2)) 

j 3 

and we get that the number of balls k is bounded by the same C. □ 

Lemma 14. Let (G, u) be a random graph satisfying (P)g almost surely. For almost 
every G, there exists c > such that for every e > and n large enough, and for every 
proper covering of B p (n) by balls of radius en, if h : G -» 1R. is harmonic and has mean 
on all the balls of the covering then 

Y h(z) 2 u(z)<ce 2 Y, H z ) 2 Hz)- (17) 

B p (n) B p (4n) 

Further, c depends only on Cp, the constant in the Poincare inequality. 

Proof. Let G be an environment and fix n large enough so that (P)g holds true for A = 1/e 
and en: for every x e B p (n) and / a map on B p (n), 

Y (f(v) -7 B , (£ n)) 2 K?/)<Cp(en) 2 Y \f(v)-MMv,z)- 

yeB x (en) (y,z)eE(B x (2en)) 

Consider h : G -> K. a harmonic function and B^ (en), . . . , B^ fc (en) the proper covering 
of B p (n) from the statement of the lemma. The hypothesis asserts that /iB„.(en) = for 
every i, so that Poincare inequality implies 

Y h\z)u{z)= Y {h{z)-h* yi{£n) fv{z) 

B yi (sn) B yi (en) 

< C P e 2 n 2 Y (h(z)-h(t)) 2 »(z,t). 

E{B Vi {2en)) 

Since the B^. (en) have uniformly bounded overlap (each point belong to at most c balls) , 
and since ~Q y .{2en) c B p (2n), we find 

Y h 2 (z)u(z) <cC P e 2 n 2 Y (h(z) - h(t)) 2 u(z,t). (18) 

Bp(n) E(B p (2n)) 

Using the reverse Poincare inequality (Proposition 12) for the larger ball, we conclude 

Y h 2 {z)u{z) <4cC P e 2 Y h 2 (z)v(z) (19) 

B p (n) B p (4n) 

which implies the claim with c = 4cCp. □ 

Proof of Theorem 4- We aim to prove that the space of harmonic functions u such that 
|m(x)| < Cd(p, x) k for every x e G is finite dimensional. Consider an environment G 
satisfying (VD) G and (P) G - Let c 2 and c be large enough so that the two previous 



19 



lemmas hold true. On the set of harmonic functions on B p (n), a scalar product between 
two functions can be defined by 



(Mn= E f(x)g(x)u(x). 

B P (n) 

Consider d harmonic functions u±, . . . ,Ud on G and set V = span(«i, . . . , Ud). 

Let £ > be some parameter to be fixed later. For n large enough, there exists a 
proper covering H yi (en), . . . ,B yM (en) of B p (n) by M = M\i e balls. Therefore there is a 
codimension d-M vector space Vq c V of harmonic functions with mean on each of the 
balls H yi (en). Let v±, . . . , Vd be an orthogonal basis of V for (v) 4n such that v\,.. . ,va-M 
is a basis of Vq. Examine the Gram matrix of V{ i.e. the d x d matrix whose entries are 
(vi,Vj) n . Then 

d 

det[{{vi,Vj) n } itj \ <Y\(vi,Vi) n 

i=i 

d-M cZ 

< n c£(^,^) 4n n (vi,vi) 4n 

1 i=d-M+l 

= (cs) d - M det[{{v t ,v J ) 4n } hJ l 

where in the first line we have used Hadamard's inequality, in the second Lemma 14 
and in the last, the fact that (fj) is orthogonal for (•, •)/!„. Now, the ratio of two Gram 
determinants is preserved by linear operations on vectors, so we can return from the basis 
{vi} (which was specific to n) to our "original" basis {ui}. We get 

det [{(«i,%)n}jj] < (ce)^ M det[{(uj,w i ) 4 n}j,i]- 

Iterating the reasoning, we find for every r > 

det[{(^,M j ) n }i ij ] < [(c6:)^ M ] r det[{(Mi,M j )4-„}ij]. 

Since every entry of the matrix is smaller than C(4: r n) k thanks to the bound on the 
growth of the harmonic functions, we find 

det[{(u i ,u i ) n } ij -] < d\n kd C d ((ce) d - M i kd y. 

We now fix e to be 4r 2fc /c. If d > 2M then this would imply 

(cs) d ~ M A kd = 4r 2k ( d ~ M ) +kd < \ 

and the right hand side would converges to 0. We deduce that det [{{ui, Uj) n }ij~\ = 0, and 
that the Ui restricted to the ball of radius n form a dependent family. Since this is true 
for every n large enough, we easily deduce that (u^ is a linearly dependent family. The 
result holds for any family of d harmonic functions with growth bounded by Cd(-,p) k . It 
implies that the dimension of the vector space of harmonic functions with such growth is 
smaller or equal to 2M. □ 
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Example 3.1 (Infinite cluster of percolation). The infinite cluster of percolation satisfies 
(VD) U and (P) u almost surely [4]. Therefore, spaces of harmonic functions with pre- 
scribed polynomial growth are finite dimensional. 

Example 3.2 (Random conductance). Random conductances with uniform elliptic con- 
ditions also satisfy (VD) u and (P) u deterministically. Therefore, spaces of harmonic 
functions with prescribed polynomial growth are finite dimensional. 

Example 3.3 (Wedges). Let / be some slowly varying function from [0, oo) -> [0, oo). 
Define the wedge with respect to d and / to be 

W:= {xeZ d :\x d \<f(\x 1 \ + - + \x d _ 1 \)}. 

Then it is well-known and not difficult to see that W (with the graph structure inherited 
from Z d ) satisfies volume doubling and Poincare inequality. Under some weak condi- 
tions on / and d (which we will not detail here, as that would take us too off-topic) so 
would percolation on W. Hence both W and supercritical percolation on it have a finite 
dimensional space of harmonic functions. 



4 Linearly growing harmonic functions on the infinite 
cluster of percolation 

In this section, we fix d > and p > p c (d). The infinite cluster of supercritical percolation 
can be thought of as an approximation of Z d . In particular, macroscopic properties of the 
cluster are the same as those of R d . For instance, the random walk satisfies an invariance 
principle (CL7% [67, 14, 62]: define 



B n (t) := 




where for non-integer tn we define X tn as the linear interpolation between X\ tn \ and 
X^ tn y, that is, X tn = Xy tn ^(tn - [tn\) + X^ tn ~\(\tn~\ - tn). There exists a(d) such that 
the law of (B n (t),0 < t < oo) converges weakly to the law of a Brownian motion with 
variance o~(d) as n -> oo. The main step in the proof in all three papers [67, 14, 62] is 
the construction of a rf-dimensional space of harmonic functions {f v } wR d such that /„ has 
slope v i.e. f v (x) = (v,x) + o(\x\). Let us state this as a theorem. 

Theorem 15 ([67, 14, 62]). Let d > 2, and p > p c (d). Let u be the infinite cluster of 
percolation on Z d with parameter p. Then, for almost every oo there exists \ : u -> M. d such 
that x h> x + x( x ) i s harmonic on u, and 

lim — sup |y(x)| = a.s. (20) 
n ^°° n xeBp (n) 

This (random) function is called the corrector. 

With the constant functions, we get a (d + l)-dimensional space of harmonic func- 
tions with (sub-)linear growth. Our aim in this section is to prove Theorem 5 from the 
introduction, namely that there are no other harmonic functions of linear growth. 
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Proof outline. Let h be a harmonic function with linear growth. Define h n : M. d -> R 
such that h n (x) = h(nx)/n. In order to prove Theorem 5, we first show that (h n ) forms 
a precompact family (one can say that h has a scaling limit). The second step is to 
identify the possible limits. For this, we use the average property at the discrete level 
and the invariance principle to prove that limits are harmonic on IR d . If the space of 
limits is at most d-dimensional, one can then use the absence of non-constant sublinear 
harmonic functions to show that the space of harmonic functions with linear growth is 
(d + l)-dimensional. 

Properties of the supercritical cluster. Recall that the infinite supercritical cluster 
of percolation to can be seen as a stationary random graph with polynomial growth. It is 
well-know that the system is ergodic with respect to the shift by Xi, see e.g. [14, Theorem 
3.1]. Typical balls have the same growth as in the ambient space Z d in the following sense: 
there exists constants c and C such that for any finite A, any n sufficiently large and any 
x € Bp(An) 

cn d < u(B x (n)) < Cn d . (21) 

Clearly, (21) implies volume doubling (VD) w . Moreover, the graph satisfies (P) u almost 
surely. Both properties were proved by Barlow [4]. Actually, Barlow proved quantitatively 
stronger versions of (21) and (P) w '- he obtained the volume growth estimates and the 
Poincare inequality for every ball of radius larger than Clogn in B p (n). These improved 
versions allow to prove Harnack inequalities and Gaussian estimates (1) on the heat kernel. 
In [4] these results are stated for continuous time random walk, but they hold also for 
simple random walk, as was explained in [7, Section 2]. We do not need the full force 
of Gaussian estimates here — in particular we do not need far off-diagonal lower bounds 
which are particularly difficult — so let us make a list of corollaries from these Gaussian 
estimates which we will use. 

Corollary 16. For every finite X, every n sufficiently large and every x e B p (An) ; 

P,(X„ 2 = y) < Cn d exp [-C\x - y\ 2 /n] My. (22) 
This immediately implies 

E x [\X n 2 - x\ 2 ] < c 3 n 2 (23) 

for some constant C3 depending on the environment. 

The lower bound has some periodicity requirements, because clearly P x (X t = y) = 
whenever t + Y,( x i ~ Hi) is °dd. 

Corollary 17. For every finite X, every n sufficiently large and every x e B(Xn), 

P x (X n 2 = y) > Cn~ d My e T5 x (n) such that n 2 + - yi) is even (24) 

We start with a technical lemma which allow to move estimates at p to estimates at 
most points. The main ingredient is ergodicity. 
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Lemma 18. Let f(x,y,oo) be some positive translation-invariant variable i.e. f(x + s,y + 
s,u + s) = f(x,y,oo), with M := E/(0,Xl,u;) < oo. Then for every A > and for almost 
every environment u, 

E f(x,y,oo)v(x,y)<C-M-n d 

(x,y)eE(B a (n)) 

for n sufficiently large, and for all a e B p (An). 

Proof. We will not denote the dependency of / on w from this point on. By ergodicity 
[14, Theorem 3.1], for almost every oo, \ Y,t=o f(X t ,X t+ i) converges in L 1 to M (L 1 here 
is with respect to the measure P p ) . Therefore 

i k— 1 

- £ E p [f(X t , X t+1 )] < 2M for k sufficiently large. (25) 
t=o 

Moreover, the heat kernel lower bound (24) implies that there exists C > such that for 
every t sufficiently large and every x e B p ( v / t) with t + £ Xi even, 

P p [X t = x]> ( 



which we sum over t to get, for x ~ y e B p (fc) and A; sufficiently large 



1 2 £„ r „ „ , G 



— ^P p [X t = x,X t+1 = y]> 
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Hence, 

|f> ( h.\\ i 2k 2 (25) O /Lf 

E f(x,vM*,v) * ■ ^EW^w)] * 7T-|B P (fc)| (26) 

(x,l/)€B(B p (fc)) °4 AK *=0 °4 

for A; sufficiently large. Fix some A; such that the probability that (26) holds for k > ko is 
bigger than |. Call a point 6 e oo good if 

(x )W )€E(B 6 (fc)) U 4 

We know that p is good with probability greater than |. 
By ergodicity again, for almost every environment oo, 

■y k-l 

T E 1 {Xt is good} — ► P[p is good] P p - almost surely (27) 

^ t=o 

when k goes to infinity. Now, consider an environment oo and fix 5 > 0. The maximum 
distance between two good points in the box B p (n) behaves like o(n). Indeed, if it was 
not the case there would exists e > such that with positive P p -probability, the random 
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walk X would not visit a good point during an interval en before time n for an infinite 
number of n (here we used implicitly the invariance principle to see that the walk has 
positive probability to reach that "bad ball", and stay there). This would prevent the 
left hand side in (27) from converging P p -almost surely as during these intervals the sum 
E l{x t is good} would not increase while k would increase by a factor of 1 + e, causing the 
average to "plunge". This cannot happen infinitely many times for a converging sequence. 

Therefore, for every a e B p (An), there exists a good point b at distance less than n. 
Thus, 

£ f(x,y)v(x,y) < £ f{x,y)v{x,y) 

(x,y)eE(B a (n)) (x,y)eE(B b (2n)) 

2M (21) 
<=^-\B b (2n)\ < CMn d . □ 

Recall the h n from the proof sketch on page 22. There we defined h n (x) = h(nx)/n 
which is a priori only defined on the contracted infinite cluster. For simplicity let us 
extend it to all R d , e.g. by extending h to Z d by taking the value at the closest point 
of the infinite cluster, and then to R d by dividing each square (n,m) + [0, l] 2 into two 
triangles and interpolating on the triangles linearly. Once h is extended to all R d , so is 

Proposition 19. For almost every environment to, any harmonic function h on u with 
linear growth satisfies that for every compact K c M. d , the sequence (h n )\K is uniformly 
bounded and equicontinuous. 

Proof. Fix a harmonic map h with (at most) linear growth on an environment to. There 
exists A > such that < A\x\. We only need to prove equicontinuity on the ball 

(other compact sets K work the same). To do so, we prove that for any 7] > 0, there 
exists S > such that (h(a) - h(b)) 2 < nn 2 for any two points a,b e B p (n) at distance 5n 
of each other, when n is large enough. For this reason, we will always assume that n is 
large enough so that (P) w and the rf-dimensional volume growth (21) hold true for an 
appropriate A. 

Let 5, e > to be fixed later (think about e « 5) and a, b e B p (n) with d(a, b) < Sn. Let 
B be some ball of radius 2Sn containing both B a (Sn) and B&(5n) — for example around 
the middle point of [ab]. Let h be the average Y,xeB h(x)v(x). Since \h(a) - h(b)\ < 

\h(a) -h\ + \h(b) -h\, it is enough to estimate these terms, and we estimate \h(a) - h\. The 
other term is symmetric. 

Set £ to be the event that \X( en )2 -a\> Sn. Note that 

Ea(\X {£n) .-a\ 2 ) (23) C3 (eny 2 

P a(^)< 7T-^ < , x x 2 =C 3 (g/d) 

(on) z (on) z 

where the Markov inequality was used in the first inequality and the quenched diffusive 
behavior (23) in the second. 
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Now, we have 



\h(a)-h\ 2 <(E a [\h(X (£n)2 )-h\ 

<2(E a [|/ i (X (£n)2 )-I|l # ]) 2 + 2(E a [|/i(X (£n)2 )-I|l # c]) 2 

We first deal with the first term on the right: 

(E a [\h(X (en) 2) -h\U]f < E a [(\h(X (£n)2 )\ + \h\f] • P a (<f ) 

<(2E a [/ i (X (£n)2 ) 2 ]+2^ 2 )-P a ( ( r) 
Since h(x) < A\x\ < 2A(E a [|^( e „)2| 2 ] + (1 + S) 2 n 2 ) ■ P a (<f ) 

by (23) < 2A(c 3 (l + e 2 ) + (1 + S) 2 )n 2 • c 3 j 2 = c 5 n 2 ^ 

where in the first inequality we have used Cauchy-Schwarz. 

For the second term, the heat kernel upper bound (22) shows that P a (X( £n ^2 = x) < 
Ce/(en) d for any x e B p (n) and n large enough. Therefore, 

(E a [\h{X {£n) .) -h\l g c]) 2 < E a [\h{X {enY ) -h\ 2 Uc] 

< - — ^— ^ \h(x) - h\ v(x) 

\ £n ) xeBJSn) 



/ ^\ y \h(x) - h\ u(x). 

Wis 1 1 



Harnessing Poincare inequality, we find 



M\ h (X i£ nr)-h\l^f<^%S 2 n 2 £ \h(x)-h(y)\Mx,y), 

\ £n ) (x,y)eE(B') 

where B' is the ball with same center as B and radius A5n. 

Now, the quantity A n controls the gradient of an harmonic function. Indeed, the same 
reasoning as the one used to derive (11) implies that 

\h{x) - h(y)\ 2 < (E x [\h(X n )\ 2 ] + E y [\h(X n )\ 2 ) A n (x, y) 2 
<A 2 {E x [\X n \ 2 ] + E y [\X n \ 2 ])A n {x,yf 

for every n. Using diffusivity and taking the liminf, we obtain 

\h(x) - h(y)\ 2 < c 7 liminf ^/nA n (x,y) 2 

n—><x 

where c 7 does not depend on the points x,y (though it does depend on h through A). 
Denote this liminf by A 00 (x,y) 2 . We get 

(E a [\h(X (£n)2 ) -h\l^]f < E c 7 A x (x,y) 2 v(x,y). 

6 n (x,y)eE(B) 
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We next note that ~EA 00 (p, Xi) 2 < oo. Indeed, the infinite cluster of percolation is a sub- 
graph of Z d , it has uniform polynomial growth and H n < C\ log n for every n. Theorem 7 
implies that E[A n (p, ^i) 2 ] < Ci\n for an infinite number of n. Using Fatou's lemma, we 
obtain that E[A 00 (p, Xi) 2 ] < oo. Thus we may use Lemma 18 for the function / = A 2 ^ 
and get (with the fact that B' has radius 4cm) , 

E„[|/i(X (£n)2 ) - h\l g c] 2 < c 8 — n 2 . 
Putting together the estimates for the two terms, we obtain 

2 ( E 2 5 d+2 

(h(a)-h(b)) <n 2 (c 3 — + c 8 — 

which implies the claim provided 5 = £( d+1 )/( d+2 ). □ 

A final tool for the proof of Theorem 5 is a slight strengthening of Theorem 3 for the 
case of percolation. Theorem 3 required the function to be sublinear at every scale, but 
here it will be important to relax this assumption to work at only an (arbitrarily sparse) 
sequence of scales. 

Lemma 20. For supercritical percolation, H n - if n _i < C/n for every n, where C is a 
constant depending only on d and p. 

(again, in Section 2 we showed that it holds for a sequence of n, here we need the 
claim for all n). 

Proof. The heat kernel estimates (1) show, after a little calculation, that 

H n = ^logn + 0(l) \/n>n (u). (28) 

For n < n (u) we can use a much rougher bound, say H n < rflog(2n) which follows from the 
fact that for any cluster u the distribution of R n is supported on the cube {-n, . . . , n} d and 
any measure has entropy smaller than the entropy of the uniform measure on its support. 
Since no(cu) has a stretched exponential tail, we can integrate over the environment and 
get that H n = | logn + 0(1). This means that H n - H n / 2 < C for some C. Using the fact 
that H n - H n _i is decreasing (Corollary 9 on page 10) proves the lemma. □ 

Corollary 21. With probability 1, every harmonic function h on a supercritical percola- 
tion cluster which satisfies that for some n k -> oo ; 

E[h(X nk ) 2 ] = o(n k ) 

is constant. 
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Proof. The proof is identical to that of Theorem 3. Equation (11) still holds for every n: 



E p \h(p) - h(X t )\ < ^/2E p [A n (p,X i y]E p [h^X n )]. 

We know from the previous lemma that H n - H n _i < C/n for all n, and in particular for 
the n k for which we have E[h(X nk ) 2 ] = o(n k ). By Theorem 7 this gives 

EfA^p,*!) 2 ] <C/n k . 

Using Fatou's lemma we know that there is some random c and a random subsequence 
n' k of rik for which 

E[A n , k (p,X 1 ) 2 ]<c/n k . 

so E p \h{p) - h(Xi)\ = o(l) as k -> oo, but since it does not depend on k it must be 
zero. So h(p) = h(Xi) almost surely. By stationarity, h(X n ) = h(X n+ i) almost surely, 
since h, shifted to X n , still has a subsequence of times n k where it is sublinear. Hence 
h(p) = h(X n ) for every n, almost surely. By irreducibility, it is constant. □ 

Proof of Theorem 5. Let d > 2, the constant functions on to are obviously harmonic. The 
projections of x + x( x ) where % is the corrector (see Theorem 15) on each coordinate 
provide us with d linearly independent functions. These functions have linear growth. 
Therefore, the space of linear growth harmonic functions is at least (d+ l)-dimensional. 

Now, let h be a harmonic function on ui with (at most) linear growth and with h(0) = 0. 
Proposition 19 allows us to extract a subsequence of (h Hk ) converging uniformly on any 
compact subset of IR d to a continuous function h. Assume for a moment that h is proved 
to be linear. Then, f(x) ■= h(x) - h(x + x( x )) is a linear harmonic function on oo with the 
following additional property: for every e there is a k Q such that for all k > k , 

\f(x)\ < en k Vx with Id < -n k . 

£ 

i.e. it is sublinear on a sequence of (space) scales. A simple calculation with the Gaussian 
upper bounds (22) shows that it is also sublinear on a sequence of time scales i.e. 

E[f(X nl ) 2 ]<en 2 k \/k>k' . 

By Corollary 21, it must be constant. Since /(0) = that constant is zero and h = h o ^. 
Therefore, any harmonic function with growth at most linear and equal to at belongs 
to a vector space of dimension d and the result follows. In conclusion, it is sufficient to 
prove that h is linear, or equivalently that it is harmonic, since it is well known that 
harmonic functions with at most linear growth on M. d are the affine maps (take the partial 
derivative along one direction, it is a bounded harmonic map on M, d , and thus a constant 
map). 

Let now B t be Brownian motion. Our first goal is to examine Brownian motion starting 
from 0, namely, we wish to show 

E [h(B t )] = h(0) Vt>0. (29) 
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To see (29) note that h is harmonic and hence E p [h(X t )] = h(p) or equivalently 

E [h nk (X n 2 t /n k )] = h nk (0). 

The central limit theorem (Theorem 15) allows to control h(X t ) in a ball of radius « \/t. 
Namely, because (X n 2 t /n) converges weakly to B t , 

E [h(X n 2 t /n k ) ■ l\x nlt /n k \<K] - E [h(B t ) ■ l\B t \< K ]\ < e{K) + o(l) 

where e(K) goes to as K -> oo and the o(l) is as k -> oo. The Gaussian bounds (22) 
and the linear bound on /i nfe and ft, allow to control h(X t ) outside that ball 

| E o[^n fe (^„2 t /n fc ) • l\X n 2 t /n k \>K 

and similarly for h(B t ). This shows (29). 

We now extend (29) from to all points u using Lemma 18. Let us recall that weak 
convergence is metrizable. For example, it is equivalent to convergence in the Levy- 
Prokhorov distance metric [33, Section 11.3, especially Theorem 11.3.3]. We will not 
need any property of the Levy-Prokhorov metric except that it is equivalent to weak 
convergence. 

Now fix t, and fix also some e and some no- Consider a vertex x in the cluster to 
be good if the Gaussian estimates (22) hold for all n > no and if the Levy-Prokhorov 
distance between X n 2 t /n (started from x) and B t (started from x/n) is smaller than e. 
If no is sufficiently large (depending on t and e), the probability of x to be good will be 
> 1 - e. For the Gaussian estimates this follows directly from (22) while for the Levy- 
Prokhorov distance this follows from the equivalence of Levy-Prokhorov convergence and 
weak convergence. Fix therefore no to satisfy this property. 

Now use Lemma 18 with the function / being f(x,y) = 1{ X is bad} (the y variable is 
simply ignored) and with some arbitrary A. We get that for sufficiently large n, the 
number of bad x in B p (An) is bounded by C(A)n d P(0 is bad) < C{\)en d . Define 

B n := {u € R d : \u\ < A, un is bad} 

(where as usual we in fact take the point of the infinite cluster closest to un and check 
whether it is bad). Since the measure of B n is < Cs, we see that, except for a set of 
measure < Ce every u e R d with \u\ < A satisfies that un is good for infinitely many n. 
But e (both for the error and for the measure of the bad set) was arbitrary. Taking e -> 
and then A -> oo we see that for almost every itel d there is a sequence n' k - n' k (u) (a 
subsequence of n k ) such that 

1. The Gaussian estimates hold for X n > k started from un' k 

2. The Levy-Prokhorov distance between X^ n ^2 t /n' k started from un' k and Brownian 
motion started from u goes to zero. 



< e(K) \fn sufficiently large 
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Using again the equivalence of Levy-Prokhorov convergence and weak convergence we get 
that random walk started from un' k converges to Brownian motion started from u. We 
can now repeat the argument that led to (29) literally and get 

E u [h(B t )] = h(u) 

for almost every u. Since h is continuous, this in fact holds everywhere. Since t was 
arbitrary, h(B t ) is a continuous martingale, from any starting point. 

The theorem is now proved. Using the strong Markov property we get that h(u) is 
equal to its average over a sphere of arbitrary radius around u, in other words, we have 
established the mean-value property hence h is harmonic. □ 

A natural extension of the supercritical bond percolation setting is to look at random 
environments on Z d , such as random conductance with uniformly elliptic conditions. See 
[67] for the existence of the corrector in this case. Similar results can probably be obtained 
in this setting. 



5 Heat kernel derivative estimates 

Our purpose in this section is to prove Theorem 6 which gives an upper bound for the 
(discrete) derivative of the heat kernel, p n (x, y) -p n _i(x' ,y), for x ~ x', where p n (x,y) := 
P x (X n = y). 

We start with a lemma true on any graph. It relates the infinity norm of the gradient 
of the heat kernel to the infinity norm of the heat kernel and the entropy. 

Lemma 22. Let G be a graph of maximal degree d, for any x,x',y e G then with x ~ x' , 

(P2n(XiV) -P2n-l(x',y)f < 

< Ad{d+ 1) • A n (x,x') 2 • max p n (a,b) • max p n (a, b) (30) 

a,beB x (2n): a,beB x (2n) 
d(o,6)>d(x,j/)/2 

where A n is defined in (6). 

Proof. We know by Markov's property that 

P2n(x,y) -P2n-i(x',y) = £ (p n (x, a) - Pn-i(x', o))p n (a, y). 

Let us split the sum on a 6 G into two sums I + II, where I is the sum over a e 
I5 x (d(x,y)/2), and /J on the remaining a. Using Cauchy-Schwarz we can write 

J 2 < X! (Pn(x,a)-p n _ 1 (x',a)) 2 \i £ Pn(a,y) 2 

\aeB x (d(x,y)/2) ) \o6B lc (d(i,y)/2) 

For the first term, bound the denominator in the definition of A by its maximum and get 
Y (p n (x,a) -p n _i(x',a)) 2 < A n (x,x') 2 ■ max {p n (x, a) + p n -i(x', a)} 

aeB x (d(x,y)/2) aeB x (d(x,y)/2) 
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For the second term write 

Y, Pn(a,y) 2 <( max p n (a,y))-\ £ Pn(a,y) 

aeB x (d(x,y)/2) aeB x (d(x,y)/2) \aeB x (d(x,y)/2) 

«B B (d( a;i y)/2) \« B<B (d(i )W )/2) 

< d-( max p n (a, w)). 

Together we get 

I 2 < d- A n (x,x') 2 ■ max {p n (x, a) + p n -i(x', a)} ■ max p n (a,y). (31) 

asB x (d(x,j/)/2) aeB^(d(a;,j/)/2) 

Now, the second maximum in the right-hand side of (31) is a maximum on a smaller set 
than the first maximum in (30) (note that points in TS x (d(x,y)/2) are at distance larger 
than d(x,y)/2 from y). Similarly, the first maximum is smaller than (1 + d) times the 
second maximum of (30). Therefore, the product of maxima is smaller than 

(d+l)- max p n (a,b) ■ max p n (a,b). 

a,beB x (2n): a,beB x (2n) 
d(a,b)>d(x,y)/2 

The estimate for II is the similar 

II 2 < d ■ A n (x, x') 2 ■ max {p„(i, a) + p n _i(V, a)} • max p n (a,y). 

HK ' a{B x (d(x,y)/2) l ^ nK ' Vn U ,S a{B x (d(x,y)/2) ^ ^' 

It is easy to obtain the same bound again, except the estimates are reversed (i.e. what 
was bounded by the first term before is now bounded by the second term). We sum up: 

(p 2n (x, y) - p 2n -i(x', y)) 2 = (I + II) 2 < 2(I 2 + II 2 ) 

< 4d(d + 1) • A n (x, x') 2 ■ max p n (a,b) ■ max p n (a,b). □ 

a,beB x (2n): a,beB x (2n) 
d(a,b)>d(x,y)/2 

Proof of Theorem 6. As before, percolation can be seen as a stationary random graph, 
and it is sufficient to prove 

C 

E( (p 2 n(p,a:)-p 2n _i(A > 1 ,x)) 2 -l {:c ^ } ) < ^ r exp(-Cl|x| 2 /n) 

where C% and C 4 depend only on d and the percolation probability p. Also note that one 
can restrict ourselves to \x\ < n 1 / 2+£ / 3 , since in the regime \x\ > n 1 / 2+£ / 3 , the heat kernel 
decreases fast enough so that one can tune the constant C' A in order to obtain the result 
for free. 

Again we use the variables n y (u) from (1,2). Take e to be given by the stretched 
exponential bound for n y (u), (2). Fix |x| < n 1 / 2+£ / 3 . Let N(u) = m&x{n y (uj) : y € B p (n)}. 
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So the Gaussian estimates (1) imply that for a.e. environment u such that x e u, whenever 
n > N(u), we have 

C C 
max p n ( a i b) < — ttt exp [ - C 4 lx| 2 /nl and max p n (a, b) < — -j-. (32) 

a,beB (n): n d ' 2 L a,beB p (n) n d ' 2 

d(a,b)>d(p,x)/2 

Averaging (30) on the environments satisfying N(ou) < n (for which we have (32)), we find 

^[(P2n(p,x) -p 2 „-i(Xi,a;)) 2 • l{ x€a ,}l{jV(u,)< n }] 

f 2 

< Ad{d + 1) • E[A n (p, Xi ) 2 ] • -| exp [ - C\\x\ 2 /n] . 

We now apply Theorem 7 to bound E[A 2 ] by 2(H n - H n -i). Recall also that Lemma 
20 from the previous section says that for critical percolation H n - H n -\ < C/n for all n. 
Together these give 

C ■ C 2 

E[(p 2 „(p,x) -p 2 „-i(^i,a:)) 2 • l{xsu;}l{iv(u,)<n}] <4rf(rf+ 1) d+1 3 exp[-C 4 lxl 2 /n]. 

We do not need to control the behavior of the gradient on {N(u) > n} since this event 
has probability at most Cn d e~ n "'. Hence in the regime |x| < n l l 2+£ l ? > we find 

E[(p 2 „(/9,x) -p 2 n-l(^l,x)) 2 • l{x€w}l{iV( W )>n}] 

C 2 

<¥ p (N(u) >n) < Ad(d+1)-—^ exp [-C 4 \x\ 2 /n]. 

Putting all the pieces together, we obtain the result. □ 

The proof involved only Gaussian estimates at mesoscopic scale and the entropy argu- 
ment. It extends to other contexts such as random conductances satisfying the uniform 
elliptic condition (see Example 2.1). One may then get, using convolution, annealed sec- 
ond space-derivative and first time- derivative estimates for the heat kernel using the first 
space-derivative estimates. We refer to Section 5 of [29] for more details. 

6 Open questions 

This study must be understood as an introduction and some initial steps in the subject. 
There are many natural questions on harmonic functions which remain open. We present 
few of them in this section. 

Minimal growth harmonic functions. The question of minimal growth harmonic 
functions was implicitly studied in the literature: the failure of the Liouville property 
corresponds to a special case of minimal growth. When the the Liouville property is true, 
it becomes interesting to determine the minimal growth. Even the deterministic case 
(i.e. transitive or Cayley graphs) has interesting phenomenology and we plan to analyze 
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some examples in a future paper. Note that groups always admit linear growth harmonic 
functions [51, 66, 68]. This is no longer the case for stationary random graphs. When 
the random walk is subdiffusive (note that the random walk on Cayley graphs is at least 
diffusive, a result due to Erschler, see Lee and Peres [55]), Theorem 3' (page 16) implies 
a phenomenon which is specific to random environments: 

Proposition 23. Let (G, u, p) be a stationary random graph with polynomial growth such 
that the random walk is (strictly) subdiffusive. Then, almost surely there do not exist 
linear growth harmonic functions. 

Therefore graphical fractals, UIPQ, critical Galton- Watson trees conditioned to sur- 
vive and the incipient infinite cluster (IIC) do not admit linear growth harmonic functions. 
We mention that it was already proved [9] that the Uniform Infinite Planar Triangulation 
is almost surely Liouville. There are no non-constant harmonic functions on the critical 
Galton- Watson tree or on the IIC as both have infinitely many cut vertices. Indeed, the 
Galton- Watson tree is well-known to be one-ended and hence, as a tree, must have in- 
finitely many cut vertices. The existence of cut points for the IIC is essentially known, 
but we did not find a reference and including a full proof would take us too far off-topic. 

Question 1. Do there exist non-constant harmonic functions with polynomial growth on 
the UIPQ? 

If such functions exist, we may ask the following question: 

Question 2. What is the minimal growth of a non-constant harmonic function on the 
UIPQ? 



Space of harmonic functions with polynomial growth. Cayley graphs with poly- 
nomial growth automatically satisfy the volume doubling property and the Poincare in- 
equality, thus implying that spaces of harmonic functions with prescribed polynomial 
growth are finite dimensional. The possibility of such behavior in the case of stationary 
random graphs of polynomial volume growth is a legitimate question. 

Question 3. Let (G,u,p) be a stationary random graph with polynomial growth. Is the 
space of harmonic functions with prescribed polynomial growth finite dimensional? 

The difficulty comes from the fact that we do not necessarily have Poincare inequality 
at our disposal (in the case of the UIPQ for instance). Therefore, we cannot use the 
technology developed in Section 3 in the general context. We mention that there exists 
another strategy to prove finite dimensionality, proposed in [27], relying on the following 
weaker statement: for every harmonic function h on G and x e G: 

h2{x) - u(B°(n)) £ h2 (y>(y)- 

where is C a constant independent of x and n. This inequality appears in standard 
proofs of elliptic Harnack inequalities and holds for a larger class than those satisfying 
the doubling volume property and Poincare inequality. Still one cannot expect it to hold 
in graphs with small "bottlenecks" like the UIPQ. 
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Question 4. Is the space of harmonic functions with some prescribed polynomial growth 
on the UIPQ finite dimensional? 

Dimension of spaces of harmonic functions. The computation of the dimension 
of spaces of harmonic functions does not restrict to the case of linear growth harmonic 
functions. For a graph G and k > 0, let c4[G] be the dimension of the space of harmonic 
functions with growth bounded by a polynomial of degree k. 

The similarity between Z d and the infinite cluster of percolation might extend to the 
dimension of the space of harmonic functions with arbitrary polynomial growth. More 
precisely, we ask the following question: 

Question 5. Do the families (c4[w])fc>o and ((4[Z d ]) k>o have equal dimension almost 
surely? 

In particular, an interesting intermediate step toward this question would be to show 
that there is no harmonic function with non-integer growth. 

It is natural to ask if an invariance principle for the random walk in the random envi- 
ronment oj implies that the sequence coincides with (c4[Z d ]). On Z d , diffusivity 
and the invariance principle are robust under rough isometry. Therefore, one can ask if 
(dk[G])k>o is invariant under rough isometry for these kind of graphs. This is not true in 
general. For instance, Liouville property is not invariant under rough isometry (see [57] 
for the first example or [12] for a simpler one). 

More generally, one can ask whether a small perturbation of a Cayley graph modifies 
drastically the harmonic functions on it. For instance, consider percolation on a Cayley 
graph G such that p u (G) (the infimum of the values for which there exists a unique infinite 
cluster) is strictly smaller than 1. Fix p > p u (G) and set u(G) to be the unique infinite 
cluster of the percolation with parameter p. 

Question 6. Are the dimensions of spaces of harmonic functions with a given growth equal 
for G and cu(G)? 

Note that the question, in the case of bounded harmonic functions, was addressed in 
[11]. 

In the context of Cayley graphs, the space of harmonic functions with a certain growth 
rate is crucial in the study of the underlying group. Indeed, the latter acts on harmonic 
functions naturally. In the random setting, we do not have this interpretation. Neverthe- 
less, an interesting question is to understand what information on the random graph is 
encoded in the sequence (dk[G])k>o- In particular, the following question would be a first 
step in this direction: 

Question 7. Consider a random subgraph G of 7L d . What are the requirements to ensure 
that (d k [G])k>o equals (4[Z d ]) fc > ? 
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